Libraries tagged by HTML extraction
j0k3r/php-readability
485976 Downloads
Automatic article extraction from HTML
atrox/matcher
83307 Downloads
Powerful XML and HTML matching and data extraction library
dotpack/php-boiler-pipe
4554 Downloads
PhpBoilerPipe. Boilerplate Removal and Fulltext Extraction from HTML pages
gregpriday/laravel-zyte-api
13 Downloads
A Laravel package for seamless integration with Zyte's web scraping API, offering functionalities for extracting raw HTML, browser-rendered HTML, and structured article content.
vanry/readability
35 Downloads
Automatic article content extraction from html and html parser.
sleimanx2/grawler
295 Downloads
A guided html crawler with media meta extraction
ncjoes/pdf-suite
232 Downloads
A high level wrapper over Poppler-Php for PDF content extraction and conversion using Poppler utils
aspose/pdf
34 Downloads
A powerful library for manipulating and converting PDF files.
anshu-krishna/html-scraper
12 Downloads
A set of PHP classes to simplify data extraction from HTML.
matejch/html_helpers
5 Downloads
Helper class for removing elements and content, and extracting file paths
clientbg/php-boiler-pipe
35 Downloads
PhpBoilerPipe. Boilerplate Removal and Fulltext Extraction from HTML pages. Based on dotpack's PHP implementation.
ahadabasi/php-readability
1 Downloads
Automatic article extraction from HTML
hstanleycrow/easyphparticleextractor
7 Downloads
Free PHP library to extract the main content from an article post or news post, including images and HTML
einfacharchiv/microdata
90 Downloads
Extract billing data from HTML (supporting Microdata and JSON-LD)
ouxsoft/livingmarkup
3830 Downloads
A Processor for Markup written in PHP. Allows extraction of Markup into a data structure, orchestrated nested manipulation of said structure, and output as (optimized) Markup.